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Taguchi’s T-method is a new prediction technique under the Mahalanobis- 
Taguchi system to predict unknown output or future states based on 
available historical information. Conventionally, in optimizing the T-method 
prediction accuracy, Taguchi’s orthogonal array is utilized to determine a 
subset of significant features to be used in formulating the optimal prediction 
model. This, however, resulted in a sub-optimal prediction accuracy due to 
its fixed and limited feature combination offered for evaluation and lack of 
higher-order feature interaction. In this paper, a swarm-based binary bat 
optimization algorithm with a nearest integer discretization approach is 
integrated with the Taguchi’s T-method. A comparative study is conducted 
by comparing the performance of the proposed method against the 
conventional approach using mean absolute error as the performance 
measure on four benchmark case studies. The results from experimental 
studies show a significant improvement in the T-method prediction 


accuracy. A reduction in the total number of features results in a less 
complex model. Based on the general observation, the nearest integer-based 
binary bat algorithm successfully optimized the selection of significant 
features due to recursive and repetitive searchability, in addition to its 
adaptive element in response to the current best solution in guiding the 
search process towards optimality. 
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1. INTRODUCTION 

A predictive analytic technique is an essential tool in obtaining information about the future state or 
unknown outcome by analyzing current or historical data. The growth of big data, the availability of efficient 
software and hardware for data processing, and a competitive and agile business environment have motivated 
the development of various prediction approaches [1]. Some examples include Bayesian network, artificial 
neural network, multiple linear regression, logistic regression, and machine learning algorithms such as 
random forest [2], [3]. One of the relatively new predictive modeling technique that is capable of producing 
an accurate outcome is the Taguchi’s T-method (T-method), which was introduced by Dr. Genichi Taguchi 
under the Mahalanobis-Taguchi system (MTS) [4]. Out of many methods under MTS architecture, the T- 
method is designed explicitly for solving prediction-based problems involving multivariate information. 
Nevertheless, the T-method is also capable of performing classification tasks with the aid of interval values 
computed according to classified groups or items. Fundamentally, the T-method is developed based on the 
combination of the regression principle and Taguchi’s robust quality engineering elements in formulating a 
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predictive model. The blend of mathematical-statistical theory such as linear regression and weighted average 
model with unit space element and orthogonal array (OA) experimental design has become the significant 
differentiation factor that distinguished the T-method from other available predictive techniques. 

As a multivariate prediction technique, the T-method predictive model is formulated with multiple 
input features to predict the output. As a consequence, the prediction model becomes complex as the 
dimensionality of data gets higher, and in many situations, not all input features are significant and relevant 
towards the prediction outcome [5]. Conventionally, in optimizing the T-method predictive model, an OA 
design was employed for determining a subset of significant input features based on the maximum value of 
the integrated estimate signal-to-noise ratio as the performance measure. Researchers [6] and [7] highlighted 
that the application of OA in performing feature selection is insufficient and yields a sub-optimal solution. It 
is mainly due to the OA's fixed and limited variable combination, limiting the possibility of finding the 
optimal solution [8]. In addition, Kim et al. [9] stressed that the OA lacks exploiting higher-order interactions 
(mixture) between variables, leading to sub-optimality of prediction accuracy. In response to these concerns, 
this paper proposed the utilization of a swarm-based binary bat optimization algorithm (BBA) with the 
nearest integer discretization approach as an alternative to the OA. 

The utilization of metaheuristic algorithms such as the BBA for feature selection optimization 
problems is not new. Many studies reported the success of metaheuristic algorithms in solving combinatorial 
problems, as demonstrated in [10], [11]. There are many types of metaheuristic algorithms available for 
consideration that are categorized into four according to their search behaviors, which are evolution-based, 
swarm intelligence-based, physics-based, and human-related algorithms [12]. The BBA belongs to the swarm 
intelligence-based along with particle swarm optimization (PSO), ant colony optimization, honey bee swarm 
optimization algorithm, cuckoo search optimization, and many others. Specifically, in optimizing the T- 
method prediction accuracy, Harudin et al. [13] successfully employed the artificial bee colony algorithm for 
feature selection optimization. The result shows an improvement in prediction accuracy as compared to the 
conventional OA approach. In a different study, Harudin et al. [14] utilized a modified artificial bee colony 
algorithm with a binary bitwise operator as the feature selection approach, and the outcome recorded an 
enhancement in prediction accuracy. These studies have shown practicality in employing metaheuristic 
algorithms as the T-method feature selection optimization. 


2. METHOD 

The development of an optimal T-method prediction model consists of two main phases. The first 
phase focuses on the development of the basic prediction model, while the second phase concentrates on the 
optimization of the formulated model through a feature selection process. This paper focused on the second 
phase to replace the conventional approach using Taguchi’s OA with Nearest Integer-based Binary Bat 
algorithm. 


2.1. Development of the basic T-method prediction model 

Prior to optimizing the T-method prediction model using the proposed approach, the basic 
prediction model must be established. The development of the basic T-method predictive model involved the 
determination of two important model parameters known as a proportional coefficient, B and signal-to-noise 
ratio (SNR), 1. In estimating the model’s parameters, the raw data is first transformed into signal data 
through a normalization process using the average of unit space data. Unit space is a concept emphasized in 
the Mahalanobis-Taguchi system that represents a homogeneous population against the target group [15]. As 
such, a subset of homogeneous output data located in a densely populated region is selected, and the average 
value is computed for the output and respective input features. Normalization of signal data is performed by 
subtracting the average value from raw data, and the unit space data is discarded from signal data for further 
computation activity. As a result of normalization, the prediction model in the form of a linear regression line 
has zero intercept value (through the origin). Theoretically, the T-method predictive model is represented as 
an integrated estimate output value that measures the predicted outcome, as shown in (1). It was formed by 
combining all input features by considering the effect of the respective model parameters using a weighted 
average approach. 


XxX; X; X; 
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1 B2 (1) 


Integrated Estimate Output Value, M; = 
Ny t2 +.. +Nk 


where X; is the normalized signal data for respective i” signal data and k is the number of features. Estimation 


of model’s parameters performed using (2) and (4): 
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MyX4j + M2X2j + ... + MıXıj 


(2) 


Proportional coef ficient, p; = Z 


where j is the input feature, M is the normalized output of signal data, / is the number of signal data, and r is 
the effective divider computed using (3). 


Ef fective divider,r = M? + MZ + ... +M? (3) 
+(Spj-V, j) 
ia SPI ER g A à 
SNR,n; = Vy (when Sg; > Vej) (4) 
= 0; (when Sg; < Vej) 


where V; is the error variance obtained by (5) until (8). 


Sej 


Error variance, Ve; = ~~ (5) 
Error variation, Sej = Srj — Sp; (6) 
Total variation, Sr; = Xf, + XZ, + .. + XË (7) 


2 
M1ıX11 + M2X21 +... + MX]; 
Variation of proportional term, Sg; = AO Re E (8) 


An integrated estimate SNR (db) is computed using (9) until (15) to quantitatively represent the 


model’s quality upon establishing the T-method prediction model and obtained the predicted value, M. 


1 
7Spj-Vej) 


Integrated estimate SNR (db), Nest = 10 log a (9) 
Linear equation,L = M,M, + M,M, + ... + MM, (10) 
Ef fective divider,r = M? + M2 + ... + MŽ (11) 
Total variation,S; = M2 + M2 +... + M? (12) 
Variation of proportional term, Sg = - (13) 
Error variation, Se = Sr — Sg (14) 
Error variation, Se = Sr — Sg (15) 


2.2. Taguchi’s orthogonal array for feature selection 

In brief, Taguchi’s orthogonal array is a fractional orthogonal design with a predetermined design 
array proposed by Dr. Genichi Taguchi, which reduced the number of experiments run significantly, allowing 
for efficient sampling of multidimensional design space [16]. From the perspective of feature selection 
process, the utilization of a fractional orthogonal design with a predetermined design array suggested that 
optimality of the T-method prediction model could not be achieved using Taguchi’s orthogonal array as the 
process of feature selection is not dynamic in evaluating numerous potential feature combinations to return 
the optimal (near-optimal) solution. Nevertheless, Taguchi’s orthogonal array offers speed in computation 
and less experimental cost by utilizing a balanced predetermined design array. Specifically for feature 
selection optimization, a 2-level orthogonal array is used to represent the ‘used’ and ‘not used’ state of the 
feature in the combination. As for the T-method, the selection procedure of significant input features 
comprises four steps. The first step involved a suitable selection of OA design based on the number of input 
features. The second step involved the estimation of integrated estimate SNR (db) for each combination in 
the selected OA using (1) until (15). Next, the third step involved estimation of the average value of 
integrated estimate SNR (db) for each level or state for each feature. The last step is evaluating the average 
differences between levels, which can be done by developing a factorial effect chart to visualize the 
differences. A reduction in average integrated estimate SNR (db) from ‘used’ state to ‘not state’ of feature 
signify that the respective feature plays an important role in contributing to the prediction outcome. 
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2.3. Binary bat algorithm 

The consideration of using a swarm-based metaheuristic algorithm to optimize the T-method's 
feature selection process is highly due to its ability to offer an optimal (near-optimal) solution. Some of its 
prominent traits are the ability to avoid premature solutions through exploration and exploitation of solution 
space strategy, stochastic and adaptive behavior in dealing with a complex problem, simplicity, ease of 
implementation, and operates at a reasonable computation cost [12]. Specifically, a bat algorithm is selected, 
which was developed based on the natural echolocation ability of microbats to identify prey, discriminate 
between different types of targets, and avoid obstacles [17]. Conceptually, microbats generate a very loud 
sound pulse with varying qualities. The echo that bounces back from the surrounding item is listened to and 
processed for various reasons, including hunting and navigation. Accordingly, three approximation rules are 

suggested in the development of the bat algorithm [18]: 

a. Bats utilize echolocation to sense distance and can distinguish between food, prey, and background 
barriers in a miraculous manner; 

b. Bats search for prey by flying at a random velocity, vi at position, xi with a fixed frequency, fmin, 
varying wavelength, and loudness, AO. Depending on the closeness of their target, bats may automatically 
modify the wavelength of their emitted pulses as well as the rate of pulse emission, r €[0, 1]; 

c. Although loudness can vary in a variety of ways, the loudness assumed ranges from a maximum, Ao to a 
minimum, Amin. 

In the bat algorithm, artificial bat, i moved by updating frequency, f, velocity, v; and position, x; 

using (16), (17), and (18). 


fi = fmin + (fmax = fin) P (16) 
vi = vi + i XSi (17) 
ee ee (18) 


where p e [0,1] is a random number generated from a uniform distribution, ¢ is the time step or iteration and 
x» is the current global best solution. To further exploit the solution for optimality, a random walk is 
performed using (19) on the selected best solution. 


Xnew = Xola + eA‘ (19) 


where s e [0,1] is a random number from uniform distribution and Z is the average loudness, A of all bats at 
iteration, t. The balance between exploration on the global scale and exploitation of solutions in the targeted 
region of the bat algorithm is controlled by the loudness, A, and pulse emission rate, r. Both parameters are 
updated accordingly as iteration grows using (20) and (21): 


Atti = qA$ (20) 
rf** = r? [1 — exp(-yt)] (21) 


where a and y are constant. The pseudocode of the bat algorithm is available in [17]. 

The binary version of the bat algorithm was introduced by [19] and [20] for discrete and feature 
selection optimization. Instead of moving in a continuous solution space, bats in the binary bat algorithm 
move across the corner of a hypercube space modeled as an n-dimensional Boolean lattice. In the feature 
selection optimization problems, a binary vector in the form of ‘1’ and ‘0’ is used to represent the used and 
not used of feature in binary-string combination. As such, the position is restricted to binary-valued, and 
updating the bat’s position means switching between ‘1° and ‘0’ values. [19] and [20] suggested that 
switching should be done based on the velocity of bats and proposed sigmoid-based and v-shaped transfer 
functions, respectively, to map the continuous velocity values into a probability value before employing a 
binary operator in determining the binary position of bats. For the sigmoid transfer function, continuous- 
valued of bat velocity v; is transformed into probability-valued between 0 and 1 using (22). The position of 
the bat is then determined using a binary operator, as shown in (23). Meanwhile, the V-shaped transfer 
function uses (24) to transform continuous into probability value and determine the position of the bat 
using (25). 
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Sigmoid function, S (v¥(t)) = —a5 (22) 
+e tl 
, 0; if rand < CO) 
(t+ D= (23) 
1; if rand = CO) 
V — shaped function, V (vt (t)) = |Z arctan ( KO) (24) 
=f 
k << k 
gaye (x! (t)) ; if rand < v (v (t)) 05) 


xë (t); if rand = V (vE) 


where vt(A is the velocity of i” bat for k” feature at iteration t, and the x*(¢+/) is the position i” bat for k” 
feature at iteration of t+J/. 


2.4. Proposed nearest integer-based binary bat algorithm (NIBBA) 

In this paper, the nearest integer discretization approach is used in the binary bat algorithm for 
transforming the continuous bat’s position into binary values. It was reported in [21] that this approach was 
first introduced in [22] for solving reactive power and voltage control optimization problems. Later [23] 
employed the nearest integer discretization approach for the binary bat algorithm. Theoretically, this 
approach involved rounding the continuous-valued bat’s position to the nearest integer for each feature or 
dimension using (26): 


XK = ||x* mod2|| (26) 


where x; is the position of i” bat for k” feature [24]. Firstly, the continuous-valued position of the bat is 
divided by two. Then, the absolute value of the remainder is floored in obtaining a binary value of either 0 or 
1. For instance, given the continuous-valued position of a bat is 3.45, the computational procedure begins by 
dividing 3.45 by 2, resulting in the remaining 1.45. The absolute value of |1.45| is then floored L1.45] 
resulting in the value of 1. The process is repeated for every feature or dimension until a binary string of 
combinations is obtained. Contrary to the sigmoid and v-shaped transfer functions approaches that depend on 
the bat's velocity in determining the binary position, the nearest integer method switches the position itself. 
This, however, is criticized as there is a possibility that the solution obtained is not in the optimal region, and 
the quality of the solution in the rounded point does not represent the original continuous position [21]. The 
pseudocode of the proposed NIBBA is as shown in Figure 1. The discretization of the continuous position of 
the bat using the nearest integer scheme is performed at line 4 for the initial population’s position and line 13 
for the new bat’s position. 


NIBBA Algorithm 

: Initialize random continuous bat population xi (i = 1, 2, ..., population size) 
: Initialize bat velocity, vi = 0 

: Initialize bat frequency, fi at Xi, 

: Initialize bat pulse rate, ri, and loudness, A; 

: Transform continuous-valued position, xi into a binary value using (26) 

: Evaluate fitness 

: Find the best: fitness, binary position, Xx+, continuous position, X» 

: While (iteration < Maximum iterations) 

: Generate new solutions: 


BIRT BWNK 


© 


10: Adjust frequency: fi = fmin + (fmax — fmin)B 

1l: Updating velocity: vf = vf? + (xf — xf 

12: Updating position new: xf = xf? + vf 

13: if (rand > ri) 

14: Select a solution among the best solutions. 

15: Generate a local solution around the selected best solution 
16: Transform continuous-valued new position, xi into a binary value using (26) 
17: end 

18: Evaluate new fitness 

19: if (rand < A; & f(x, ) < f(x» )) 

20: Accept the new solutions 

21: Increase pulse rate, ri and reduce loudness, A; 

22: end 

23: Rank the bats and find the current best x. 

24: end while 


Figure 1. Pseudocode of the NIBBA algorithm 
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3. RESULTS AND DISCUSSION 
3.1. Experimental design 

In determining the performance of the proposed T-method with the NIBBA algorithm, an 
experimental study was conducted using four benchmark datasets, as shown in Table 1. The first three 
datasets were obtained from the University of California at Irvine (UCI) machine learning repository [25], 
while the body fat dataset was obtained from the Carnegie Mellon University StatLib repository [26]. All 
datasets that come from different case study domains consist of multiple input features and a single output 
with a different number of samples. Such variability is purposely designed to allow for better performance 
verification of the proposed method in dealing with varying problem settings and conditions. A hold-out 
cross-validation is employed to the datasets, where 70% of the data will be used as train data set for the 
learning of the prediction model, while the remaining 30% is allocated as validation data set to verify the 
effectiveness of the model when tested on a newly seen data set. Specifically for this study, five-unit space 
data were selected and discarded from the raw train data. As such, the total number of signal data is about 
five less than the total number of the raw train data set. 

Prior to verifying the effectiveness of the proposed method in feature selection optimization, the 
basic T-method prediction model involving all features was developed using the normalized signal data set. 
The model parameters required in formulating the prediction model, such as the proportional coefficient, B 
and SNR, n, were computed for each dataset. Table 2 shows the example of proportional coefficient, P and 
SNR, y parameters computed for the cooling load dataset. Any feature with a negative value of SNR will be 
replaced with a zero value, as shown in feature X6 in Table 2. This is in accordance with the condition given 
in (4). Next, the optimal parameter setting for the NIBBA is determined for each dataset. Specifically, for this 
experimental study, the Taguchi method was employed by utilizing an L27 orthogonal array in designing the 
experiment involving six parameters with three levels, as shown in Table 3. Each parameter’s level was 
obtained from various studies of past research employing the Binary Bat algorithm. For each combination in 
Lz array, the experiment run repeated three times, and the average value of integrated estimate SNR (db) was 
used as the response to determine each feature's significant level except for the Abalone dataset. Since every 
combination in the experimental run converged to an optimal objective function which is the maximum value 
of integrated estimate SNR, the response for the Abalone dataset is set as convergence rate computed using 
(27). With the utilization of Minitab software, the parameter’s level with the highest signal-to-noise ratio was 
selected as the optimal parameter setting. The optimal parameter setting for each dataset is as shown in 
Table 4. Next, in obtaining the optimal input feature using the proposed T-method with the NIBBA approach 
utilizing the optimal parameters setting, 20 independent runs were executed with 500 internal iterations for 
each run. The objective function of the proposed T-method with the NIBBA algorithm is to maximize the 
integrated estimate SNR value, which returns the optimal combination of input features. The selection of the 
final optimal feature combination is based on the 50% and more feature appearance in the optimal 
combination of every run, as practiced in [14]. 


Table 1. Benchmark datasets 
Train data set 


Dataset No. of sample No. of feature 10% Signal data Validation data set (30%) 
Abalone 4177 7 2924 2919 1253 
Concrete 1030 8 721 716 309 
Cooling load 768 8 538 533 230 
Body fat 252 14 177 172 75 


Table 2. Computed model parameters for cooling load dataset 


Parameter X1 X2 X3 X4 X5 X6 X7 X8 
B 0.007 -6.342 1.772 -4.057 0.166 0.001 0.003 0.012 
n 0.005 0.005 0.002 0.011 0.007 -1.897 0.0001 2.404 


Corrected 0.005 0.005 0.002 0.011 0.007 0 0.0001 2.404 


Table 3. Experimental parameters settings 


Parameter Level 1 Level 2 Level 3 
Population size 10 25 30 
Frequency [0, 1] [0, 2] (0.8, 1] 
Loudness 0.25 0.5 0.75 
Pulse rate 0.01 0.5 0.9 
Alpha, a 0.1 0.7 0.9 
Gamma, y 0.6 0.9 0.95 
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Table 4. NIBBA parameters setting 


Parameter Abalone Concrete Cooling Load Body Fat 
Population size 25 10 10 25 
Min Freq, Fin 0 0 0 0 
Max Freq, Fmax 2 2 2 1 

Loudness, Ao 0.5 0.25 0.25 0.75 
Pulse rate, ro 0.5 0.9 0.5 0.01 
Alpha, a 0.9 0.1 0.1 0.9 

Gamma, y 0.95 0.6 0.6 0.6 


(Iterationyax — Iterationstart converge) 


Iterationyax 


Simple convergence rate, C, = x 100% (27) 

Upon obtaining the optimal subset of input features, a feature reduction rate is computed using (28) 
to evaluate the reduction ability of feature selection [27]. Subsequently, the prediction accuracy is computed 
using the mean absolute error (MAE) error metric as the performance measure in quantifying the model’s 
accuracy using the validation data set, as shown in (29). 


# original features — # selected features 


Reduction rate, R, = x 100% (28) 


# original features 
Mean Absolute Error (MAE) = <i aM — M; (29) 


where M; is the actual output of the validation data set, M, is the predicted output value for the validation data 
set, and / is the number of validation data set. A comparison study is then conducted to compare the 
prediction performance between the T-method with full feature, T-method with OA, and T-method with 
NIBBA. Finally, hypothesis testing using a paired t-test statistical methodology is performed to determine 
whether there are significant differences between the mean outcome of each approach, as formulated in (30). 
A p-value is established to decide whether to accept or reject the null hypothesis at a significance level of 
0.05 (5%) [28]. Throughout the study, a MATLAB R2020a programming application software was utilized 
in constructing and executing the algorithms on a laptop-type computer powered by an Intel Core i5-8250U 
central processing unit, 4 Gigabytes of random-access memory, and 1 Terabyte of storage capacity. 


Ho: Hy = Lz; mean difference is equal (no difference) 
Hy: 44 # H ; mean difference is not equal (difference) (30) 


3.2. Experimental results 

Table 5 shows the optimal number of features with their respective combination (in the parenthesis) 
and reduction rate, R, for all three approaches obtained using the train data set. Apparently, for the T-method 
with full features, all original features are used in formulating the prediction model. On the contrary, the T- 
method with OA and T-method with NIBBA approaches recorded a reduction in the total number of input 
features with different optimal feature combinations. In general, a reduction in the total number of features 
indicates that a less complex T-model prediction model was achieved through the T-method with OA and T- 
method with NIBBA, which potentially improved the prediction accuracy and fastened the computation time. 
It can also be concluded that both the T-method with OA and T-method with NIBBA approaches 
successfully identify insignificant features and offer only a subset of significant features to be incorporated in 
the prediction model based on their respective methodology. 

Table 6 shows the prediction accuracy in terms of MAE value for the three approaches obtained 
using optimal features combination on the validation data set. The percentage value inside the parenthesis is 
the percent enhancement of the T-method with OA and T-method with NIBBA against the T-method with 
full features. Obviously, the proposed T-method with NIBBA recorded the best MAE value for all case 
studies, indicating the optimality of the prediction model achieved when only significant features were 
incorporated in the model. Nevertheless, the conventional approach also recorded MAE enhancement on the 
prediction accuracy, suggesting that a feature selection process is important in a multivariate dataset. From 
the perspective of the ability in obtaining the model’s optimality, when more combination is generated and 
analyzed as in the NIBBA, the probability of obtaining an optimal feature subset that contributes to the 
enhancement of the T-method prediction accuracy is greater as opposed to conventional fixed and limited 
combination to be assessed. 

Table 7 shows the result of paired t-test hypothesis testing to determine whether there are significant 
differences in the mean of predicted outcome between the T-method with OA and T-method with NIBBA 
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approaches using the validation data set. The recorded p-value for the concrete, cooling load, and body fat 
datasets was less than the significance level of 0.05, resulting in the rejection of the null hypothesis (Ho). 
Thus, it can be concluded that there were significant differences between the outcome of both approaches. 
However, for the Abalone dataset, the p-value of more than 0.05 failed to reject the null hypothesis and 
suggested the differences between both approaches are not statistically significant. 


Table 5. Optimal features combination 


Dataset Item T-method + full features T-method+OA T-method + NIBBA 
Abalone No. of feature T 3 2 
Optimal combination all (2,3,7) (3,7) 
Reduction rate, R, - 57.1% 71.4% 
Concrete No. of feature 8 6 6 
Optimal combination all (1, 3, 4, 5, 7, 8) (1, 2, 4, 6, 7, 8) 
Reduction rate, R, - 25.0% 25.0% 
Cooling load No. of feature 8 4 4 
Optimal combination all (3, 4, 7, 8) C1, 3, 7, 8) 
Reduction rate, R, - 50.0% 50.0% 
Body fat No. of feature 14 4 3 
Optimal combination all (1, 2, 4, 7) (1, 2, 7) 
Reduction rate, R, - 71.4% 78.6% 

Table 6. Prediction accuracy (MAE) on the validation data set Table 7. T-test result 
Dataset T-method + full features _T-method+OA T-method + NIBBA Dataset T-value p-value 
Abalone 3.65 3.23 (11.5%) 3.16 (13.4%) Abalone 1.89 0.059 
Concrete 16.36 16.15 (1.0%) 15.19 (7.2%) Concrete -5.57 0.000 

Cooling load 8.08 5.95 (26.4%) 5.40 (33.2%) Cooling load -5.24 0.000 
Body fat 0.40 0.25 (37.5%) 0.18 (55.0%) Body fat -4.29 0.000 


4. CONCLUSION 

This paper proposed the integration of the nearest integer-based binary bat algorithm with Taguchi’s 
T-method as the feature selection optimization. The results from the experimental study using benchmark 
datasets show that the integration is feasible, and the NIBBA algorithm is capable of searching for the 
optimal feature subset affecting the T-method prediction accuracy. It was observed that the process of feature 
selection, either through conventional OA or NIBBA algorithm, successfully discarded insignificant features, 
resulting in a less complex model. Moreover, the T-method with NIBBA successfully obtained a better 
subset of optimal features as compared to the conventional T-method with OA based on MAE results 
recorded using the validation data set. In addition, the results from paired t-test hypothesis testing indicate 
that the difference between the T-method with OA and T-method with NIBBA outcome is statistically 
significant and not a chance of coincidence, except for the Abalone case study. In general observation, it was 
found that the recursive and adaptive feature embedded in the binary bat algorithm drives the search for an 
optimal solution. As for the nearest integer discretization approach, it allows the bat to globally explore the 
continuous-valued solution space and exploit the continuous-valued best solution during local search since 
the discretization is executed at the destination or bat’s position. Furthermore, NIBBA operates at a low 
computational cost. In conclusion, the integration between T-method and NIBBA algorithm is beneficial by 
offering better T-method prediction accuracy. 
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